CITYU-HIF: WSD with Human-Informed Feature Preference

نویسنده

  • Oi Yee Kwong
چکیده

This paper describes our word sense disambiguation (WSD) system participating in the SemEval-2007 tasks. The core system is a fully supervised system based on a Naïve Bayes classifier using multiple knowledge sources. Toward a larger goal of incorporating the intrinsic nature of individual target words in disambiguation, thus introducing a cognitive element in automatic WSD, we tried to fine-tune the results obtained from the core system with humaninformed feature preference, and compared it with automatic feature selection as commonly practised in statistical WSD. Despite the insignificant improvement observed in this preliminary attempt, more systematic analysis remains to be done for a cognitively plausible account of the factors underlying the lexical sensitivity of WSD, which would inform and enhance the development of WSD systems in return.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modeling Local Interest Points for Semantic Detection and Video Search at TRECVID 2006

Local interest points (LIPs) and their features have been shown to obtain surprisingly good results in object detection and recognition. Its effectiveness and scalability, however, have not been seriously addressed in large-scale multimedia database, for instance TRECVID benchmark. The goal of our works is to investigate the role and performance of LIPs, when coupling with multi-modality featur...

متن کامل

Theme: A Study of Classifier Combination and Semi-Supervised Learning for Word Sense Disambiguation

1. Aims Word Sense Disambiguation (WSD) involves the association of a polysemous word in a text or discourse with a particular sense among numerous potential senses of that word. In my thesis, we present a study of classifier combination and semi-supervised learning for WSD, which aim to boost supervised WSD and improve accuracy of WSD. In addition, we also work on context representation and fe...

متن کامل

Lexical acquisition at the syntax-semantics interface : diathesis alternations, subcategorization frames and selectional preferences

ion Substance Artifact Location NOWSD 0.9 36.4 1.3 1.6 SPass 0.2 54.3 0 5.2 FirstS 1.0 38.6 1.3 0.3 COMB 0.4 54.1 0.5 0 types. PTCMs particularly tended to be less specific than ATCMs. Preference models at the PP slot suffered more from sparse data than subject and direct object slots. This was because the slot is less prevalent to start with, and also because the noun lemmas were considered wi...

متن کامل

Investigating the amount of changes in VO2-Max in response to aerobic exercise among careers of different genotypes of HIF-1α gene

Abstract Introduction: The significant role of HIF-1α gene in adjustment to aerobic exercise has been proved. Although, we have not informed by accurate data about different genotypes of this gene. Therefore, the aim of this study was to investigate the relationship between different genotypes of HIF-1α gene with changes in aerobic capacity following aerobic exercise in obese women. Methods: ...

متن کامل

Knowledge-based Word Sense Disambiguation with Feature Words Based on Dependency Relation and Syntax Tree

Context feature words are important for word sense disambiguation (WSD). There are two kinds of methods to extract feature words: window-based and dependency relation method. Both of them have some defects. In order to solve the problems of the existing methods, this paper proposes a knowledgebased WSD method which obtains context feature words by dependency relation and syntax tree. Firstly, a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007